Global fluctuations and Gumbel statistics 
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We explain how the statistics of global observables in correlated systems can be related to extreme 
value problems and to Gumbel statistics. This relationship then naturally leads to the emergence of 
the generalized Gumbel distribution G a (x), with a real index o, in the study of global fluctuations. 
To illustrate these findings, we introduce an exactly solvable nonequilibrium model describing an 
energy flux on a lattice, with local dissipation, in which the fluctuations of the global energy are 
precisely described by the generalized Gumbel distribution. 
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The ubiquitous appearance of asymmetric distribu- 
tions in the study of fluctuations of global quantities in 
correlated systems has raised a lot of interest in recent 
years. Such non-Gaussian distributions, characterized by 
an exponential tail on one side and a rapid fall-off on the 
other side, have been observed in many models or experi- 
mental systems, in the context of turbulence 0,0, 03- CJ 01 
equilibrium critical systems P, 0, 0, S @, nonequilib- 
rium models exhibiting self-organized criticality |6J, [10| , 
interface models ,1// noi se fl3 , Langevin equations 
fl3j |. granular gas models 0, or even the statistics 
of the level of the Danube river Quite strikingly, 

this analogy is not only qualitative, but many of the dis- 
tributions observed in these very different systems actu- 
ally fall |l| |a, y, |bj, |l5| , once suitably rescaled, close to 
the so-called Bramwell-Holdsworth-Pinton (BHP) distri- 
bution describing the magnetization of the XY model in 
the low temperature limit, as well as the roughness of 
the two-dimensional Edwards- Wilkinson surface model 
161 . Yet, as not all data collapse onto the BHP curve 
a more general distribution has been proposed to 
describe the data, namely the generalized Gumbel distri- 
bution GiJx), which includes a continuous shape param- 
eter a [lEflEHBlHlIi-see below for definition. 
Interestingly, this distribution, with a = 1, turns out to 
be the exact one for periodic Gaussian 1 / f- noise Ilia : it 
is also very close to the BHP one for a « tt/2 0, Il4j . 

The distribution G a {x) originates, for inte ger values of 
a, from the study of extreme value statistics |lTj,|2fJ, and 
describes the fluctuations of the a th largest value in a 
large set of identically distributed (independent) random 
variables z% [2g. Accordingly, there is no obvious theo- 
retical motivation for the use of the distribution G a [x] in 
the study of fluctuations of global quantities. Rather, it is 
usually considered as a convenient fitting function, and a 
theoretical understanding of its relevance is still lacking. 
Indeed, the question of the underlying role of extreme val- 
ues in correlated systems has been repeatedly asked in the 
literature @, H [H EHH . Still, attempts to identify an 
extremal process dominating the dynamics of such sys- 
tems have failed up to now 0, ■ All the above body of 
results thus leads to the following questions. First, what 



is (if any) the precise relationship between global fluctua- 
tions in correlated systems and extreme value statistics? 
Second, could one find a simple physical model for which 
the fluctuations of a global quantity would be exactly 
described by a generalized Gumbel distribution? 

Global fluctuations in complex correlated systems are 
often hard to tackle analytically precisely due to strong 
correlations between local microscopic variables. Yet, in 
some cases, statistically inde pen dent collective variables 
-like Fourier modes 0,ll3i|22,|22|- can be defined, so that 
a problem of correlated random variables may be con- 
verted into a problem of independent random variables, 
with non-identical distributions -otherwise the central 
limit theorem would hold. 

In this Letter, we explain how the statistics of global 
quantities, expressed as sums of non-identically dis- 
tributed random variables, is related to extreme value 
problems, and how the generalized Gumbel distribution 
G a (x), with a real index a, emerges in the study of global 
fluctuations. Interestingly, it turns out that such a rela- 
tionship does not rely on an extremal process hidden in 
the dynamics of global variables, contrary to usual con- 
jectures. These results are illustrated on a nonequilib- 
rium cascade model in which the fluctuations of the total 
energy are exactly described by the generalized Gumbel 
distribution G a (x), where a depends continuously on the 
microscopic parameters of the model. 

Our starting point is the observation 0] that the inte- 
grated power spectrum w of periodic Gaussian 1// noise 
is distributed, after a suitable rescaling x = (w— (w))/a w 
(where erj is the variance of w) according to the Gumbel 
distribution Gi(x). The model for 1// noise used in 0] 
consists in a large number N of statistically indepen- 
dent Gaussian Fourier modes with complex amplitudes 
c n = c*_ n . Introducing y n = |c„| 2 + |c_„| 2 , one has by 



definition w = 2M- The distribution of y n reads 



Pn(y n 
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(1) 



so that w is simply the sum of N independent ran- 
dom variables y n , with non-identical exponential distri- 
butions. Note that the appearance of a non-Gaussian 
distribution is not surprising in itself, since the sum of 
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P(z) 




where the integral over z n is from z n+ i to oo, for 1 < 
n < N — 1. This expression can be understood as fol- 
lows: either the variables {z n } are already ordered, which 
straightforwardly gives the above integrals, or they are 
not, and then can be ordered through a permutation, 
which leads to the Nl factor in front. Making the change 
of variables v n = z n — z n +i (1 < n < iV — 1) and vn = zn 
in Eq. the different integrals factorize and one finds: 



FIG. 1: Sketch of the notations used in the text. N random 
values of z are drawn according to the probability density 
P(z). These values are relabelled into z[ > . . . > the 
interval between two successive z' n is denoted by y n ■ 



the variances of the y n 's converges when N — > oo [as 
var(y„) = l/(n 2 K 2 )], so that the central limit theorem is 
not expected to hold -for a detailed discussion on this 
point, see Still, the fact that a Gumbel distribu- 

tion precisely emerges may suggest the existence of some 
"hidden" extremal processes dominating the fluctuations 
of w, but no clear evidence for such processes has been 
found yet HEl- 

Actually, a different perspective may be necessary to 
understand the relationship between the two problems. 
Indeed, instead of looking for extremal processes hidden 
in sums of random variables, one may look for sums 
of random variables with decreasing amplitudes when 
studying the statistics of extreme values. To this aim, we 
introduce the following procedure, illustrated in Fig. ^ 
Considering a set of N random variables z n > (1 < n < 
N), all drawn from the same distribution P(z), we intro- 
duce the variables z' n defined by ordering the original 
variables z n : z' n = z a / n -\, where <j(n) is a permutation 



over the interval [1,N] such that 



> Z'n > 



> z' 



N ■ 



Thus z[ is simply the maximum value of the set {z n }. 
We also define the interval y n between z' n+1 and z' n : 



Vn = z' n - 4+i (1 < n < N - 1); y N = z' N 



With these notations, one can write: 



n 



max (zn) = - 1 — * 

Kn<N ^— ' 

n=l 



(2) 



(3) 



As a result, a problem of extreme value can be mapped 
onto a problem of sum of random variables. Still, it 
should be noticed that although the original variables 
z n are independent, the y n 's are a priori correlated. 

In the following, we show that in the specific case 
where P(z) is an exponential distribution P(z) = ne~ KZ , 
the y n 's prove independent and distributed according to 
Eq. The distribution fW({yn}) reads: 



' dz ie - KZl 



P N ({y n }) = k N\ / dz N e~ K * 
Jo 

N-l 

x 6(y N - z N ) \\_5[y n - (z n - z n+1 )} (4) 

n=l 



N 



P N ({y n }) = n 



(•5) 



Thus it turns out that in the specific case P(z) = ne~ KZ 
the y n 's are independent variables, distributed as the 
squared Fourier amplitudes in the 1// noise model, i.e., 
according to Eq. 0J. But as the sum of the y n 's is pre- 
cisely the maximum value of a set of exponentially dis- 
tributed variables {z„}, we know that this sum has to 
be distributed (after a suitable rescaling) according to 
G-\ ( x) , so that one recovers immediately the results of 
|12|. Accordingly, a clear relationship appears between 
the statistics of extreme values and that of sums of vari- 
ables with decreasing amplitudes. This relationship can 
actually be understood at two different levels. On the 
one hand, starting from a set of (possibly correlated) 
variables {z„}, one can always define the interval y n be- 
tween two successive variables z' n obtained by ordering 
the set {z n } -see Eq. (j3J. Thus, on general grounds, the 
maximum value of correlated variables z n can be formally 
written as a sum of correlated variables y n , but the corre- 
sponding extreme value distribution is usually unknown. 
On the other hand, it seems that the maximum value of 
a set {z n } of independent variables is related to a sum 
of independent variables {y n } only in the case where the 
ZnS are drawn from an exponential distribution, leading 
to the Gumbel distribution G\(x). Indeed, the factor- 
ization property of the exponential is essential to derive 
Eq. lrom Eq. Q. 

The above result leads to some rather unexpected con- 
sequences. From the very definition of the variables {4}, 
z' k is precisely the k th largest value of the original set 
{z n }. So we know that z' k follows, once rescaled as 
x = (z' k — (z' k ))/ak with a\ = vax(z' k ), the generalized 



Gumbel distribution Gk{x) JjJ. The distribution G a (x) 
is defined for any positive real value a by: 



G a{x) = -^-exp{-a 6 a (x + v a ) + , 
r(a) I L 

with 



d 2 lnT 
"da 5 ""' 



v a = 7T ( hi a 



9 a (x+u a ) 



rflnT 



} (6) 



da 



(7) 



where F(a) is the Euler Gamma function. Besides, 
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may also be expressed as a sum: 



N N-k+1 



(8) 



n—k 



with y n = y n _|_fe_x distributed according to 



Vfc(tf») = (« + *- l) K e~ ( " +fc - 1)KS " (9) 



Thus the sum of independent random variables drawn 
from 10 is distributed, after a suitable rescaling, accord- 
ing to Gk{x) in the limit N — > oo. But then, one can 
forget the original extreme value problem, and consider 
only the statistics of the sum, so that there is no more 
reason to restrict k to be integer. Since the generalized 
Gumbel distribution is obtained for integer k, it seems 
plausible that it also holds for real values k = a > 0. To 
be more specific, considering independent variables u n 
with distribution 



Pn 



, a {u n ) = (n + a-l)ne 



— (n+a— l)nu r 



1 < n < N (10) 



the sum X = Xm=i Un lSl P rec i ser y distributed, in the 
limit N — ► oo, according to the generalized Gumbel dis- 
tribution G a (x), where x = (X — (X))/ax- This result, 
suggested by the above argument, can be derived exactly 
without reference to the extreme value problem |25|| . 

We now illustrate the above result on a simple nonequi- 
librium stochastic model, which is defined by the fol- 
lowing rules [13] • On each site n = 1, . . . , N of a one- 
dimensional lattice, a positive continuous variable p n -to 
be thought of as an energy- is introduced. The (asyn- 
chronous) dynamics is defined through three different 
physical mechanisms involving energy, namely injection 
on -say- the left boundary, transport from one site to its 
right neighbor, and local dissipation. More precisely, an 
amount of energy between p, and p + dp can be either 
injected on the leftmost site n = 1 with a rate (probabil- 
ity per unit time) J(p)dp, transferred from site n to site 
n + 1 with rate (f>(p)dp, or removed (i.e., dissipated) from 
site n with rate A(p)dp -see Fig. On the rightmost 
site n = N, the transferred energy is actually dissipated. 
Note that the above rates do not depend on the values of 
the local energies p n , apart from the obvious constraint 
that one cannot withdraw from site n (either for trans- 
port or dissipation) an energy p greater than p n . 

The master equation governing the evolution of the 
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FIG. 2: Schematic view of the model, with the three different 
mechanisms: injection on the leftmost site with rate J(fii), 
transport from site i to site i + 1 with rate <f>(pi), and dissi- 
pation on site j with rate A(pj). 



probability distribution P({p n },t) reads: 
dP r pl 



— = / d(J,J(fj,)P({pi -fx,pj},t) 



dnJ(p)P{{ Po },t) 
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N-l 

E 

n=l 
N 



d[Mf>((J,)P({p n + M,Pn+l ~ H,Pj},t) 



iy poo 

V / d(i[A(n) + (f>(p,)5 niN ]P({p n +p,pj},i) 

n=l Jo 
N rPn 

dp[<f>(p) + A(p)]P({ P] },t) (11) 



where pj generically stands for all the variables that 
are not affected by p. In the following, we focus on 
the specific case where J(p) — e~ l3fJ '(p(p) and A(p) = 
(e XfJl ~l) 4>(p), introducing two positive parameters f} and 
A. With these assumptions, the steady-state distribution 
P(j Pn\) proves factorized and can be computed exactly 
28]; it turns out to be precisely the same as Eq. (|1( 



N 



P({Pn})= Yl^n + P), 



-(\n+f3)p n 



(12) 



with the identification A = k and (3 = (a — 1)k -note 
that P{{p n }) does not depend on the specific form of 
4>(p). As a result, the fluctuations of the total energy 
E = Yln=i P n are described in the infinite TV limit -after 
rescaling E to ensure zero mean and unit variance- by the 
generalized Gumbel distribution G a (x), with a — 1 + /3/A 
(see Fig. |3J|- Interestingly, in the limit of low dissipation 
A — > 0, one recovers a Gaussian distribution, since G a (x) 
converges to a Gaussian for a ~ -> oo. Qualitatively, the 
parameter a may be thought of as the number of sites 
having roughly the same energy, of the order of 1//3. 
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the dynamical rules should yield a global energy distri- 
bution which is still close to a Gumbel distribution. In 
addition, the present model may be useful to study other 
issues of nonequilibrium statistical physics, as there are 
very few known solvable models including dissipation. 

The author is grateful to I. Bena, M. Clusel, O. Dau- 
chot, M. Droz, P. Holdsworth, C. Mazza, F. van Wijland 
and Z. Racz for fruitful discussions and interesting com- 
ments on the manuscript. This work has been partially 
supported by the Swiss National Science Foundation. 
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FIG. 3: Distribution Pe(x) of the rescaled (global) energy 
of the model, x = (E — (E))/<je- The analytical result 
G a {x) (full lines) is compared with numerical simulations with 
00) = 1, for a = 1 + P/X = 1.7 (o), 3.3 (<), and 6 (o), show- 
ing an excellent agreement, fnset: zoom on the top of the 
curves, on a linear scale. 



Finally, we note that the 'cascade' mechanism illus- 
trated by the above stochastic model should be consid- 
ered as one possible mechanism, but perhaps not as the 
unique one. Indeed, in some systems like freely evolv- 
ing granular gases |LJ], Gumbel distributions are indeed 
observed even though the global quantity of interest can- 
not be written in an obvious way as a sum of independent 
collective variables. Yet, it must be noticed that such a 
granular system does not reach a steady state since no 
energy is injected; fluctuations are then measured in a 
scaling regime where the average kinetic energy continu- 
ously decreases. Accordingly, one might expect another 
physical mechanism to be at play in this case. 

In summary, we have shown that the generalized Gum- 
bel distribution G a (x) appearing in numerous experimen- 
tal and numerical studies should not be interpreted as a 
signature of some hidden extremal process, but on the 
contrary, as the distribution associated to an infinite sum 
of independent and exponentially distributed random 
variables u n (n > 1), with mean value [(n+a— . If a 
is integer, the variables u n can be interpreted as the inter- 
vals y n between two successive (ordered) random values 
drawn from an exponential distribution P(z) = ne~ KZ , so 
that the a th largest value among the z„'s can be written 
as the sum of the y„'s for n > a. Thus a clear connection 
between global fluctuations and extreme value statistics 
has been established. Besides, we have proposed a sim- 
ple nonequilibrium model, defined through microscopic 
stochastic rules, for which the natural global quantity is 
exactly described by the generalized Gumbel distribution 
G a (x), with a > 1 a real value related to the parameters 
of the model. Such a simple model might be considered 
as a kind of 'ideal' model, that may be extended in several 
directions to describe in a more precise way some realistic 
systems. In particular, one expects that changing slightly 
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